Scaling Up the Accuracy of Naive Bayes Classi ers a Decision Tree Hybrid
نویسنده
چکیده
Naive Bayes induction algorithms were previously shown to be surprisingly accurate on many classi cation tasks even when the conditional independence assumption on which they are based is violated How ever most studies were done on small databases We show that in some larger databases the accuracy of Naive Bayes does not scale up as well as decision trees We then propose a new algorithm NBTree which in duces a hybrid of decision tree classi ers and Naive Bayes classi ers the decision tree nodes contain uni variate splits as regular decision trees but the leaves contain Naive Bayesian classi ers The approach re tains the interpretability of Naive Bayes and decision trees while resulting in classi ers that frequently out perform both constituents especially in the larger databases tested
منابع مشابه
Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid
Naive-Bayes induction algorithms were previously shown to be surprisingly accurate on many classii-cation tasks even when the conditional independence assumption on which they are based is violated. However , most studies were done on small databases. We show that in some larger databases, the accuracy of Naive-Bayes does not scale up as well as decision trees. We then propose a new algorithm, ...
متن کاملCalibrated Lazy Associative Classification
Classi cation is an important problem in data mining. Given an example x and a class c, a classi er usually works by estimating the probability of x being member of c (i.e., membership probability). Well calibrated classi ers are those able to provide accurate estimates of class membership probabilities, that is, the estimated probability p̂(c|x) is close to p(c|p̂(c|x)), which is the true, em...
متن کاملEective supra-classi®ers for knowledge base construction
We explore the use of the supra-classi®er framework in the construction of a classi®er knowledge base. Previously, we introduced this framework within which labels produced by old classi®ers are used to improve the generalization performance of a new classi®er for a dierent but related classi®cation task (Bollacker and Ghosh, 1998). We showed empirically that a simple Hamming nearest neighbor ...
متن کاملSearching for Dependencies in Bayesian Classifiers
Naive Bayesian classi ers which make independence assumptions perform remarkably well on some data sets but poorly on others. We explore ways to improve the Bayesian classi er by searching for dependencies among attributes. We propose and evaluate two algorithms for detecting dependencies among attributes and show that the backward sequential elimination and joining algorithm provides the most ...
متن کاملImproving Naive Bayes Classiiers Using Neuro-fuzzy Learning 1
Naive Bayes classi ers are a well-known and powerful type of classi ers that can easily be induced from a dataset of sample cases. However, the strong conditional independence and distribution assumptions underlying them can sometimes lead to poor classi cation performance. Another prominent type of classi ers are neuro-fuzzy classi cation systems, which derive (fuzzy) classi ers from data usin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996